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Asymptotically efficient estimation of a scale parameter in 
Gaussian time series and closed-form expressions for the Fisher 

information 

T. Sabel* and J. Schmidt-Hieber 1 '* 



Abstract 

Mimicking the maximum likelihood estimator, we construct first order Cramer-Rao 
efficient and explicitly computable estimators for the scale parameter a 2 in the model 
Zi.n = <jn~P Xi + Yi, i = 1, . . . , n, (3 > with independent, stationary Gaussian processes 
(Xi)i g N, (i'i)ieN, and (A^)i 6 N exhibits possibly long-range dependence. In a second part, 
closed-form expressions for the asymptotic behavior of the corresponding Fisher infor- 
mation are derived. Our main finding is that depending on the behavior of the spectral 
densities at zero, the Fisher information has asymptotically two different scaling regimes, 
which are separated by a sharp phase transition. The most prominent example included 
in our analysis is the Fisher information for the scaling factor of a high-frequency sample 
of fractional Brownian motion under additive noise. 
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1 Introduction 



Let (Xi)i(zfq and (li)igN be independent Gaussian processes with known distribution. Suppose 
that we observe Z := Z n := (Zi_ n , . . . , Z n ^ n ) with 

Zi,n = crrT^Xi + Y i: i = l,...,n, and /3,cr>0. (1.1) 

In our framework, the parameter (3 is assumed to be known. We are interested in the case 
where (Xj)j g N is stationary and (li)ieN is a noise process. Our theory includes white noise 
and increments of white noise as special cases for (li)jgN (cf. Assumption [2]) . The problem, 
which we address in this work, is asymptotically optimal estimation of the scale parameter 
a 2 . In order to understand its asymptotic properties the key ingredient is knowledge of the 
Fisher information, for which closed-form expressions will be derived as well. 

Our study is motivated by estimation of the variance a 2 of a fractional Brownian motion 
(fBM) (B^) t >o at time points i/n, i = 1, ... ,n under additive Gaussian white noise (WN), 
i.e. 

V itn :=aB§ n + T€i, i = l,...,n. (1.2) 

Here, H refers to the Hurst index (or self-similarity parameter) and (ej)j is a sequence of i.i.d. 
standard normal random variables. This model has attracted a lot of attention, recently (cf. 
Gloter and Hoffmann |14[ I15| and for the special case H = 1/2, cf. Stein Gloter and 
Jacod |16l I17j . as well as Cai et al. [7]). Let us call it the fBM+WN model and note that 
the increment vector is of type (|1.1|) with /3 = H. This shows that models (|1.1|) and (|1.2|) 
coincide, if X, and Y{ are chosen as the increments of n H (B^ n — B^^,) and r(ej |n — ej_i )ri ), 
with eo, n := 0, respectively. 

Estimation of a 2 (and H) was discussed in slightly more general settings than the fBM+WN 
model by Gloter and Hoffmann |14[ [To] . In these papers it was proven that for H > ^ the 
optimal rate of convergence for a 2 is n~ l K AHJr2 \ More extensively studied and of particular 
interest is the case H = i, due to its applications to high-frequency modeling of stock returns. 
For this case, the asymptotic Fisher information is known to be n2 (8ro" 3 ) _1 (1 + o(l)) (cf. 
Gloter and Jacod [16j [T7] , and Cai et al. [7] ) . This result had a big impact as a benchmark for 
estimation of the integrated volatility (cf. Barndorff-Nielsen et al. pQ, Podolskij and Vetter 
[21~] . Jacod et al. [19] . and Zhang [28]) as well as for the asymptotic equivalence theorem by 
Reifi [23] . The fact that the multiplicative inverse of the asymptotic Fisher information is 
linear in r and proportional to the cube of a is surprising and requires further understanding. 

The main contribution of our work to the existing literature is that for < H < 1, the Fisher 
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information I\ for estimation of a 2 in the fBM+WN model is given by 

1 8H+2 2 1 

iy= n 2H+\ 2H+1 T 2H+1 CH + (j l 2H+l) i (1.3) 

where ch is a constant only depending on H (for an explicit expression of ch, cf. Corollary 
P). 

In general, we focus on the situation, where the Fisher information converges to infinity for 
n — > 00, which corresponds to consistent estimation of a 2 . Note that in view of n-^Xi = 
O p {n~P) and Yi = O p (l) it is not clear at all that there are such situations. In fact, the rate 
at which the Fisher information tends to infinity can be rather unexpected. In a first place, 
one might guess that the optimal rate of convergence for estimation of the "parameter" a 2 n~ 2/3 
is n~2 and hence the Fisher information of a 2 should be of the order ra 1-4 ^ (corresponding 
to the rate of convergence ra 2 ^ -1 / 2 ). However, this heuristic reasoning is in general not true 
and better rates can be obtained, as for instance in (|1.3|) , Surprisingly, the asymptotic Fisher 
information has two different scaling regimes. In fact we will see that for any pair {Xi)i and 
(Yi)i there is a positive characteristic such that (up to sub-polynomial factors) J" 2 oc n 1- ^ 
if < 4 and 7" 2 oc n 1-4 ' 3 if > 4. The latter appears to be the same rate as in our heuristic 
argument above. Altogether, the different scaling behavior becomes visible as elbow effect in 
the convergence rate of a 2 . As a curious fact, let us mention that the spectral densities of the 
processes do not need to be known explicitly in order to compute the proposed estimator or 
the asymptotic Fisher information. 

It is a classical result that if we observe a sample of a stationary Gaussian process with a spec- 
tral density h(6, •), the asymptotic Fisher information 1$ for estimation of a one-dimensional 
parameter 9 is given by (cf. Davies [TU] and Dzhaparidze [IT] for the general case as well 
as Fox and Taqqu [T2], Dahlhaus [S], Giraitis and Surgailis [33] for long-range dependent 
processes) 

I e=7T I" {d e logh(9,\)) 2 d\ + o(n). (1.4) 

^ JO 

In Theorem [21 we prove that under fairly general conditions on (Xi)i, a result of the type 
(|1.4p holds for 6 = a 2 in model One should note that our setting is non-standard 

and not covered within the existing literature. In contrast, due to the factor n~^, we cannot 
work with a fixed h but rather have to consider a sequence of spectral densities {h n ) n with 
degenerate limit. Furthermore, we are not in the classical parametric estimation setting, i.e. 
I™ 2 may diverge with a rate which is much slower than n. As for example in (|1.3|) . we need 
therefore to prove (|1.4|) with an approximation error which is of smaller order than o(n). This 
in turn implies that very precise control on the (large) noise process {Yi)i has to be imposed 
(cf. Assumption [5]) . Let us also mention that we cover both cases, long and short-range 
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dependence of (-Xj)j. In particular this allows to treat model (jl.2p for all H G (0, 1). 

The work is organized as follows. In Section 12, 1\ we construct the estimator and investigate 
its theoretical properties. Closed-form expressions for the Fisher information are derived in 
Section 12.21 In particular, we give some heuristic arguments why different scaling regimes 
appear. To illustrate the results some examples are provided in Section Proofs are deferred 
to the appendix. 

Notation: We write X := X n := (X 1 ,...,X n ), Y := Y n := (Y 1; . . . ,Y n ) and Z := Z„ := 
(Z ltTl , ... , Z n)n ). For two sequence (a k ) k and (b k ) k , we say that a k ~ b k iff lim fe ^oo a k /b k = 1. 
Similar, for two functions g\ and 52, we write 51(A) ~ 52(A) (for A | 0) iff lim^ fl'i(A)/fl , 2(A) = 
1. 



2 Main results 

Let U = U n be an n-dimensional, centered Gaussian vector with positive definite covariance 
matrix Eg, depending on a one-dimensional parameter 9 £M. The log-likelihood function is 

7? 1 1 

L{u\6) = --log(27r) - -log(|Sg|) - -u%-y 

with I Eg I the determinant of Eg. Let <9gEg denote the entrywise derivative of Eg with re- 
spect to 6 (which we assume to exist). Since <9glog(|Eg|) = ta^E^dgEg) and dgE^ 1 = 
— Eg 1 (i9gEg)Eg , we find for the score function 

L(V\9) := d e L{\J\6) = ~ tr(E, ^gEg) + ^U*E e 1 (9gEg)E e 1 U. 
1/2 

In distribution, U = E e ^ for an n-dimensional standard normal vector £. Together with 
some algebra this shows that the Fisher information for 8 is 1$ = \ t r ([(^Eg^^ 1 ] 2 ) (cf. also 
Porat and Friedlander [22]). In particular, for model we obtain 

/- 2 = itr([n- 2/3 Cov(X)Cov(Z)- 1 ] 2 ). (2.1) 

To simplify the notation, we will view I" 2 in the following always as a sequence in n. 

2.1 An asymptotically Cramer-Rao efficient estimator 

In this section, we construct an explicitly computable estimator which mimics the MLE. 
Furthermore, we prove that the mean squared error (MSE) of this estimator is first order 
optimal (cf. Theorem [TJ. All the results in this section, work under fairly general conditions. 
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In fact, we only require that Cov(X) and Cov(Y) are known and positive definite for all n 
as well as divergence of the Fisher information. In particular, we neither have to impose 
stationarity on or {Yi)i nor do we assume that Cov(X) and Cov(Y) have the same set 

of eigenvectors. 

The construction will be done in several steps. First, we can find n x n matrices A and D such 
that Cov(Y) = A 1 A, D l D = id n (the identity), and D diagonalizes {A" 1 ) 1 Cov(X)^" 1 . Hence, 
A := (A~ 1 D) t Cov(X)(y4~ 1 D) is diagonal and the diagonal entries are denoted by \\, . . . , \ n . 
The maximum likelihood equation in the transformed model (Zi^ n , . . . , Z n) „)* := (A~ 1 D) t Z 
motivates to consider the oracle estimator 

Oracle - (2^) ^ ^-2^. + 1)2 " ° + ^ 1. (^-2^ + 1)2 ' ^ 

To verify the equality one should note that by rewriting (|'2.ip 

1 n \2„-4/3 



2 ^ (cx^.n- 2 / 3 + i; 



2 ' 

Observe that the oracle estimator (12. 2D is unbiased and attains the Cramer-Rao bound since 
Var(<7 2 racle ) = (J™ 2 ) -1 . Oracle estimators depend on the unknown quantities itself, and are 
thus not computable. Below, we derive a simple construction for a statistical estimator 
which mimics Oracle an d is asymptotically sharp. Similar as in [TJ, we use a sample splitting 
technique. First, we take a small part of the data in the transformed model, which are 
used for a preliminary estimate, say a 2 , of a 2 . In a second step, we plug a 2 into (|2.2|) . 
Discarding all indices, which were already used for a 2 gives an estimator, which as we show 
has asymptotically the same properties as Oracle • This implies then the first order Cramer-Rao 
efficiency. 

The next lemma ensures that sample splitting can be done. 
Lemma 1. For u > and B C {1, . . . , n}, let 

i \2 -4,3 

jB ._ £ A i n 

u ' 2f^ B (u\ in -W + 1) 2 ' 

T/ien there is a sequence of index sets (A n ) n with A n C {1, . . . , n} such that 



if" ->• cxd and -> 



as n — > oo. 



Proof. Note that < Cj )n := Xfn 4/3 (n 2/3 Aj + 1) 2 < 1. Consider the partial sums Sk = 
Yh=i c i,n- and observe that Sk+i — Sk < 1 as well as 5 n = If — » oo. Therefore, we can find 
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k*(n), s.t. y/If < S k .(n) < \/7? + 1. The result follows with A n = {1,.. . , fc*(n)}. □ 

Throughout this section let (^4n)n be as in the previous lemma and pick a sequence (S n ) n , 
satisfying S n < 1, S n — > 0, and 6^1^" — ¥ oo. A possible choice is 5 n = (/J 4 ' 1 ) -1 / 8 . Observe that 

V - (2IM- 1 V Kn ~ 2 ^ ~ 1} - CT 2 + (2TM- 1 V ^-^(^n-E%) 
K {2h > (n-W\i + l) 2 " ^ + ( 1 j + 1) 2 • 

has expectation <r 2 and variance bounded by (a 4 V 1)(/J 4 " )~ 1 . Now, we define the preliminary 
estimator a 2 , as the truncated version of V, 

a 2 := (V V5 n ) A5-\ (2.3) 

This allows us to construct the final estimator a 2 for a 2 . Let = {1, . . . , n} \ A n and set 



a 2 : = (2I4")- 1 £ y^fV^S = ^ + £ ^^S^S^ - (2-4) 

it4 (<J ™ ^ + !) (^ 2 n-^Ai + l) 2 

One should note the similarity to the oracle coracle as introduced in (|2.2|) , As the following 
theorem shows, a 2 has in fact the same asymptotic MSE as the oracle, implying Cramer-Rao 
efficiency. 

Theorem 1. Suppose that the Fisher information diverges and Cov(Y) is positive definite. 
The estimator a 2 defined in (12. 4D attains the Cramer-Rao bound asymptotically over every 
compact set, not containing zero, i.e. for < a m \ n < cr max < oo, 

lim sup 7" 2 • MSE(ct 2 ) = 1. 

(Tt[(T m in,crmax] 



2.2 Closed-form expressions for the Fisher information 

So far we have seen that there are estimators which are asymptotically Cramer- Rao efficient. 
However, in order to get some understanding of the asymptotics, we need to study the behavior 
of the Fisher information. In this section, we derive explicit closed-form expressions. 

To state the results, some definitions, in particular from regular variation theory, are unavoid- 
able. For the notion of quasi-monotone and slowly varying functions see the monograph [JJ. 
A positive sequence (rj)j is called O-regularly varying if for any A > 1, 

< lim^co < hm n _>.oo < oo, 

Ml Ml 
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with |_-J the Gauss bracket. For real sequences (aj)j, O-regularly varying quasi-monotonicity 
is equivalent to the existence of a positive, non-decreasing, and O-regularly varying sequence 
(rj)j such that the sequence {ajl r i)i * s decreasing. We say that a sequence (otj)j is gen- 
eral monotone if there are finite constants C, Jo, such that for any positive integer J > Jo, 
Sjdj 1 \ a j+i ~ a j\ < C| a j|- The class of general monotone sequences will be denoted by GM . 
It was introduced and studied recently by Belov [2] and Tikhonov [27] . To simplify some 
arguments, we have relaxed the original definition slightly by introducing Jo (this does not 
cause any trouble and all results on GM sequences can be transfered with obvious changes). 
In particular GM is fairly general in the sense that it includes all well-known generalizations of 
monotone sequences, such as quasi-monotonicity, regularly quasi-monotonicity, O-regularly- 
quasi-monotonicity and sequences of rest bounded variation. 

In order to deal with boundary problems (cf. the second example in Section , we assume 
that {Xi)i is only approximately stationary in the following sense. 

Assumption 1 (Assumptions on X). Suppose that there is a stationary process (X-)j and a 
process (Ri)i such that in distribution 

X i = X' i +R i , for alii GN 

and and (Ri)i have the following properties. 

(i) There is a positive and quasi-monotone slowly varying function I : (0, oo) — > M + , satis- 
fying 

^"( x )) _^ 1, x oo forallKCR (2.5) 

such that for an index a G (—1/2, 1/2), 

lk ■= Cov{X[,X[ +k ) ~ sign(-a)fc~ 2a " 1 £(fc) as k -> oo. (2.6) 



(ii) With := (X[, . . . , X' n ), R n := (Ri, ■ ■ ■ , R n ) and \\ ■ ||2 the Frobenius norm, we have 
the uniform bound 

S up||Cov(X;,R n )|| 2 + ||Cov(R n )|| 2 <oo. (2.7) 

n 

Throughout the following, we interpret the autocovariance {jk)kez as a sequence on Z via 
7^ = 7_fc. An example for a (quasi-)monotone slowly varying £ is the logarithm log(l + •). 
However, (|2.5|) does not hold for every slowly varying function. A stronger condition, which 
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implies (|2.5p is 



lim — 1) log(A) = for an a > 1 

(cf. Theorem 1 in [5]). As a consequence (|2.5|) holds whenever lim^^ £(X) £ (0, oo). More- 
over, if it is true for £ then also for £^, [i > 0. 

The n x n matrix A denotes the backward difference operator, i.e. 

/ 1 \ 
-1 1 



A 



(2.8) 



For a vector v = (v%, . . . , u n )*, Av = (vi — vo,V2 — v\, . . . , v n — v n -\) t with vq := is the 
backwards difference process. Furthermore, the transposed matrix A* is the negative forward 
difference operator A f v = —(v2 — V\, v$ — V2, ■ ■ ■ , v n+ \ — v n Y with v n+ \ := 0. We assume that 
for a non-negative integer K, the process Y is generated by taking the i^-th finite difference 
of a white noise process (alternating between forward and backward differences). 

Assumption 2. Given a non-negative integer K and r > 0, assume that Y is an n- 
dimensional, centered Gaussian random vector with covariance matrix t 2 (AA*)^ orr 2 (A*A)^ 

Assumption [5] imposes in fact a very serious restriction, but seems to be somehow unavoidable 
in order to prove the statement (cf. also the discussion in the introduction). Our results could 
be worked out under more general boundary conditions of the difference operator, of course. 
It is indeed sufficient that Cov(Y) can be perfectly diagonalized by a discrete since or cosine 
transform. However, since the assumption above is somehow the most natural one and allows 
to treat the fBM+WN model, we will restrict ourselves to it for sake of simplicity. 

Let throughout the paper / = J2t=-oo Ik cos(fe-) denote the spectral density of (X|)j e pj. 
Although X and Y are stationary only up to boundary values, we will refer occasionally to 

/, 4 K r 2 sm 2K (5), and h n = a 2 n~ 2 ^ f + A K r 2 sin 2K (j) (2.9) 

as the spectral density of the processes X, Y, and Z, respectively. Because of the imposed 
independence of and {Yi)ii h n is the sum of the spectral densities of X and Y. 



Define 



n 



(£( n — ))ax-aa + n 1 " 4/3 , if K - a ^ ± 



n i-4/3 \ g(n)l 2 (n), UK -a- 1 



(2.10) 

4 - 
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Now, we are ready to state the main results of the paper. Surprisingly, it turns out that 
the rates depend on K and a only through their (inverted) difference, i.e. the problem is 
characterized by 

K — a 

Theorem 2. Work in model (jl-l|) under Assumptions [7] and® and suppose that K — a > 
max{/3, (4a + 1)0, 1/4}. Further assume that either 

1. a £ (0, 1/2), (7fc)fc is O-regularly varying quasi-monotone, and Y^k=-oo Ik = 0, or 

2. a G (-1/4,0) and (j k ) k G GM, or 

3. a £ (—1/2,-1/4) and there exists a constant Ci, such that for any p £ N, |7 p +i — 7 P | < 
Cl\lp\P~ l - 

Then, the Fisher information of o 2 based on n observations is given by 

*=-s-j£ IH iJ(1+ ° (1))+ ° w ' (2 - u) 

If the condition K — a> max{/3, (4a + l)/3, 1/4} is replaced by the weaker assumption K — a> 
max{/3, (4a + l)/3}, imposing additionally log(n)£ 2 (n) — > oo in t/ie critical case K — a = 1/4, 
£foen (|2.1ip /iota's as we/Z, provided there exists a constant Cf, such that 

\f(X)- f{fx)\<C f X 2a - 2 \X- for all 0< X< u<tt. (2.12) 



Let 



C(0,a) := i|_^(2sign(-a)r(-2a)cos(vra)) 



Theorem 3. Under the assumptions of Theorem [IJ i/ie asymptotic Fisher information is 
explicitly given by 

~ n 1 -^(^(n^)) (>/ V0- 4 T- C«>,a), if 0< 4, (2.13) 



and 



„ 1-4/3 «r „ 1-4/3 oo 

" jf / 2 (A)dA = ^- £ 7 i, 0>4. (2.14) 



2vrr 4 

For£= llog^OI, P> -1/2, 



fc=— oo 



O ~ n^Oogn) 2 ^ 1 r- 4 ^^ 1 , (f = 4. (2.15) 
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Corollary 1. In the fBM+WN model (fL2j) with H G (0, 1), it holds that 



°=2«TT <4 

and the asymptotic Fisher information for a 2 is given by (]1.3p with 



i/sin2H+i (ttJT) r (2H + 1) 2^+1 
ch := — 2 



where T(-) denotes the Gamma function. 



Proofs of the statements are deferred to Appendix [Bl Let us conclude the section by some 
comments on Theorems [2] and El 

First, observe that (|2.11|) is of the type (|1.4p for 6 = a 2 . The surprising fact is that one 
can compute the integral Jq f 2 (X)/h 2 l (X)dX obtaining expressions which for < 4 do not 
depend on / anymore. Let us shortly explain this. Suppose that Assumption [T] holds with 
1=1. By classical results, we can conclude that for A ! 0, /(A) ~ C a X 2a and h n (X) ~ 
C a a 2 n~ 213 X 2a + t 2 X ak with C a = 2 sign(— a)T(— 2a) cos(7ra). Next, observe that for small A, 
f 2 (X)/h 2 l (X) ~ n 413 whereas for large values the integrand behaves like \ ia ~ iK . Now, < 4 
is equivalent to 4a — 4K < — 1 and in this case the integral will be determined in first order 
by f 2 (X)/h 2 l (X) for small A. Therefore, one expects that / and h n can be replaced by their 
corresponding small value approximations C a X 2a and C a a 2 n~ 2/3 X 2a +r 2 X 4K , respectively and 

y* dX « jT (1 + C- 1 a- 2 r 2 n 2 ^X 2K ~ 2a y 2 dX. 

The r.h.s. can be explicitly solved and does not depend on /. In contrast to that, for > 4, 
the Fisher information depends also asymptotically on the whole spectrum (0, ir]. This is why 
we need the additional assumption (|2.12|) which controls the continuity of / globally. 

The phase transition for = 4 does not only affect the asymptotic constant but also leads to 

an elbow phenomenon in the rate of convergence for estimation of a 2 . If < 4 the optimal 

-68-- 

rate (neglecting sub-polynomial factors in the following) is n2 2 whereas for <0 > 4, it is 
n p 2 . The latter only depends on f3. Typically, if in an estimation problem an elbow effect 
occurs there are different sources of errors which cannot be balanced and therefore the best 
attainable rate is given by the maximum of the single error rates. However, in our situation 
the optimal rate turns out to be the minimum, more precisely it is min(n2^^~2 ; n 2/3 ~2 ). 

Let us also shortly remark on the dependence of the asymptotic Fisher information on the 
scaling coefficient a. Write p = 0/4 and 9 = a 2 . Choosing Q n (r, a,§) appropriately, we find 
that the Fisher information for 9 observing U ~ N{9 p ,Q n ) if p < 1 and U ~ N(9,Q n ) if 



10 



p > 1 coincides in first order with (|2.13p - (|2.15p (standardizing X such that Y^T=-oo ll = 1 ^ 
p > 1). Hence, in an asymptotic sense our original statistical estimation problem is related to 
a Gaussian shift model where we want to estimate the p-th. power (p < 1) of the mean value. 

Note that our results also cover the case a £ (—1/2,-1/4) for which the autocovariance 
function is not (square) summable. In fact the proof turns out to be very subtle and requires 
quite restrictive conditions. In particular, we have to impose an assumption on the increments 
of the autocovariance which is much stronger than GM . 

One should also note that in the critical case = 4 an additional log-factor appears in the 
rate of convergence. In Theorem we have restricted ourselves to the (most important) 
case where t is a power of the logarithm, which allows to evaluate the asymptotic Fisher 
information in closed form. However, from the proof one can follow a slightly more general 
version, namely that under the assumptions of Theorem [2] (in particular log(n)£ 2 (n) -4 oo) 
and with q n := n~ 4/3 ^ 2 (n 4/3 ), 

jn = n l~4V-4 / ^(1)^(1 + o( l)) +o(n ^l og(n) ). 

Theorem [2] and Theorem [3] are derived for all a G (—1/2, 1/2), a 7^ 0. The case a = is 
indeed special since as a — > 0, C(0,«) —> 00. However, in the fBM+WN model (|1.2|) (recall 
that if = 1/2 — a) this phenomenon does not play a role because of £(\) =const.= H\2H — 1|, 
which converges to zero (fast enough) as if — > 1/2. This explains why ch is continuous for 
all H £ (0, 1), whereas a 1-4 C{(),a) is not. 

Besides the classical case H = 1/2, which was mentioned already in the introduction, one 
can easily simplify the asymptotic Fisher information in the fBM+WN model (|1.2p for H £ 
{1/4,3/4}. Indeed, as a consequence of Corollary [TJ we obtain for the multiplicative inverse 
(which is the asymptotic variance of our estimator) 

IT = 1/4: (#)" 
If = 1/2: (#)" 

H = 3/4: (#)" 

Finally, one should note that the elbow effect observed in Gloter and Jacod [TB] does not relate 
to our results. In fact they have studied the fBM+WN model for if = 1/2 (i.e. BM+WN), 
where the variance of the noise is allowed to depend on n. With the notation of model (|1.1|) . 
the change in the rate appears as j3 J, 0. In particular, they also discuss the case (3 < in which 
the classical n -1//2 -rate can be achieved. In our framework, j3 < corresponds to estimation 
of the scaling parameter of {Yi)i- 
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a 



8/3 r 4/3 n -2/3 „ WMa S/3 T 4/3 n -2/3 



8a 6 Tn 



-1/2 



25^5 + V5 
V2 3 7 / 5 ttV5 



CT 16/5 r 4/5 n -2/5 „ 8 .l 2(7 16/5 T 4/5 n -2/5_ 
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3 Examples 



In the introduction, we have already discussed the main example of estimating the scale 
parameter of fractional Brownian motion under Gaussian measurement noise. The solution 
is given in fjl .31) (cf. also Corollary [1]) . In order to provide some further illustration of the 
derived results, we discuss two estimation problems for which the Fisher information can be 
explicitly computed. 

Large measurement error: Let {Xi)i denote a stationary process with long-range dependence. 
More precisely, assume that for constants A, C, and self-similarity parameter H G (1/2,1), 
the autocovariance satisfies 7^ ~ Ak 2H ~ 2 , \~fk+i — 7k\ < C"fk/k and for H G (1/2,3/4), 
additionally J2T=-oo it = 1- Suppose that we observe the scaled process (Xi)i under large 
noise, i.e. 

Z i>n = aX i + Tn f3 e h i = l,...,n, < /3 < H - 1/2. 

Now, with the notation of Theorem [31 a = 1/2 — H and = 2/(2H — 1). In particular > 4 
for H G (1/2,3/4) and < 4 for H G (3/4, 1). Therefore, an elbow effect occurs at H = 3/4 
and the Fisher information is determined in first order by 

„l-4/3 

o- 2 o_4 ' c V2> 4/> 



2 r 4 ' C V2> A> 

n 1-4 ^ log(n) 

o' 2 - Ti ' — 4' 



2H-1-2/3 8H-6 2 (2AF(2# - 1) SUl^F)) , 



Integrated fractional Brownian motion: Suppose that we are interested in efficient estimation 
of a 2 given observations {Vi >n , . . . , Vn,n)-, 

V i>n = a [ l B?ds + re,, i = 1, . . . , n, e, ^ AA(0, 1), H G (0, 1/4), 

Ji/n 

and (£f )t is a fBM, which is independent of the WN. With A as in (gSD, X,- t := Bfds - 
Jov(i-2) 5 f ds ' and e = ( £ i> • • • > e n)> consider 

AA*(Ki, n , . . . , K, n )' = (cTn- 1 -^^)^!,...^ + tAAV 

and note that (Xj)«>2 is stationary. By defining R\ appropriately, it is straightforward to 
verify Assumption [TJ In particular, we find that (|2.7|) is bounded by < n AU ~ x < 1 and 
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7fc ~ H(2H - l)k IH -\ Therefore, i = H(l- 2H), a = 1/2 - H, K = 2 and j3 = 1 + H imply 



°=2«T3 <4 



and the Fisher information is given by 



i sH+io 2_ (H + 1) sin 2^+3 (ttH)T(2H + l)2H+3 

J n 2 ~ 77,2/^+3,7 2H+3 f 2H+3- 



(2tf + 3) 2 sin(^) 
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Appendix A Notation and some remarks 

Let us first give some notation. Whenever it is clear from the context, we omit the index 
n. In particular, we suppress the index n of the spectral density h n = h and the estimator 
o\ = a 2 . Inequalities for Hermitian matrices should be understand in the sense of partial 
Loewner ordering. The matrix norms || • H2 and || • ||oo denote the Frobenius and spectral 
norm, respectively. Furthermore, we write A and V for the minimum/maximum and 

2i-l . . 

Ui := Ui n := 7r . (A.l) 

' 2n+l v ; 

The last definition occurs frequently in connection with finite-dimensional approximations 
due to the transformation property of the discrete cosine transform. Let [ - J be the Gauss 
bracket. As in (|2.10p . we denote by (r n ) n the rate at which the Fisher information tends to 
infinity (this still needs to be proved, of course). For technical reasons, however, it will be 
chosen in the following as an integer sequence, i.e. 

[n l -K^ (e{nT^)) +n 1 ~ 4/3 J, if ^ 4, 
L^-^logtn^n)], if = 4. 

This definition will be used at many places throughout the proofs. In particular, one should 
keep in mind that u Tn = 0(r n /n) and as a direct consequence of (|2.5p 

Lemma 2. If £ is as in Assumption[l\ and < 4, then 

g(n K -°>) 
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The projection of a function on {cos(A>) : < k < n} is denoted by S n , i.e. if g = 
Efcl-oo 9k cos (k-) with g k = g_ k , then 

n 

S n g = ^2 9k cos (k-) . 

k=—n 

We write T n (g) for the n x n Toeplitz matrix corresponding to g, i.e. 

T n {g) = (g\i-j\)i,j=i,..., n - (A. 2) 

In particular, Cov(X') = T n (f), with / as in (|2.9|) . Let DCTg be the discrete cosine transform 

DCT s = (t5Itt cos [(< " h)U ~ *)afti]) w=w = (75fcr cos [(< " *H) W=1> ..., B 

(which is DCT-VIII in the notation of [6j). Note that DCTs = DCTg is orthonormal. Further 
introduce the matrix D n (g) as 

D n {g) :=DCT 8 -diag( 5 (ui),5M,...,3K)) • DCT 8 . (A.3) 

Asymptotically, the eigenvalues of D n (g) and T n (<7) are 'close', provided the symbol g is 
sufficiently smooth (cf. Lemma [9]). 

If Cov(Y) = T 2 (A t A) K , consider the observation vector Z which is obtained by the sufficient 
transformation Zi^ n = Z n _i^ n . These observations satisfy our assumptions with Cov(Y) = 
T 2 (AA t ) K . Therefore, without loss of generality, we can and will consider only the first 
case of Assumption [21 i.e. Cov(Y) = t 2 {AA 1 ) k . By Lemma [71 DCTs diagonalizes A*A 
and hence also Cov(Y) = r 2 (A t A) K . The eigenvalues of Cov(Y) are explicitly given by 
A K T 2 sm 2K ( Ui ^),i = l,...,n. 

For the subsequent proofs, the following three elementary inequalities turn out to be very 
useful. Firstly, from (|2.6|) and Potter's bound (cf. for instance Bingham [1]) it follows that 
for any e > there exists a ko such that for any k > k^ 

l k -2a-l-e < ijfe-2«-l£( fc ) < lk < 2k' 2a ~ X i{k) < 4fe- 2a ' 1+£ . (A.4) 

Moreover, we can find a constant C\ such that 7^ < Cik~ 2a ~ 1Jre for all k = 1,2,... Secondly, 
if /(A) ~ CrT 2 ^ \ 2a l(l/ A), then, for every e > we can find a 5 > 0, such that for all 
A G (0, S], 

ho\ 2a+t < lf\ 2a i(\) < /(A) < 2C\ 2a l{±) < AC\ 2a ~ e . (A.5) 
Additionally, under the assumptions of Theorem [51 we know that / is bounded on [5, tt] for 
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every S > and therefore the upper bound /(A) < 4Cn~ 2 P\ 2a - " e can be extended to all 
A G (0, 7r] by enlarging the constant appropriately. Finally, for all A G (0, 7r], we have 



/(A) + 4- K r~ 2 A 2 ^ < h(X) < /(A) + r- 2 A 2/f . (A.6) 



Appendix B Proofs 
B.l Proofs of Section EH 

Lemma 3. Let a 2 be as defined in (12, 3p . Given < a < a < oo, there exists an N = N(a,a) 
such that for all n > N and for all r] > 0, 

sup_ F(\a 2 - a 2 \ > {I^)' 1 ' 2 max (l,a 2 )rj) < 2e 1 / 4 ^/Vs. 

Proof. Since 5 n —¥ 0, we can choose N such that for all n > N, 5 n < a 2 and 5~ l > a 2 . Then, 
for a 2 G [<L,a], \a 2 — a 2 \ < |<T 2 re — cr 2 |. Let ax, «2, ... be a sequence of i.i.d. Xx random variables. 
By Proposition 6 in Rohde and Duembgen [23] (similar statements have been derived also 
elsewhere, for another reference see for instance Johnstone [20], p. 74), for a vector (^j) igj 4 n 
of real- valued numbers 

P(| - !)l > ^Wvhv) < 2e 1 ^/^. 

Note that in distribution, a 2 re — a 2 = J2i^A n ~~ 1) with 

Application of the exponential inequality above together with ||/i||2 < (2/^ n ) -1 / 2 max(<7 2 , 1) 
yields the result. □ 

Proof of Theorem^ Due to the independence of (Zi)i^A n and (Zi)i & A^ , the estimator a 2 is 
unbiased. In addition, using Lemma [U the theorem is proved once we have established that 

m |Var(a 2 )-E(/ffn 

rA£s_1 



and (J J) : sup CT ^prr = 
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In the following, we make frequently use of the following observation. For any set B C 
{!,-•• ,n}, 

min(l,(S) 2 K B <J u B <max(l,(H)Vf- 
Proof of (I): Writing E[-] = E[E[-|(£ „) i64 J], 



Var(a 2 ) - E(/~ 



E 



\fn-^(a 2 - a 2 ) 



(2I~")~ 2 V 

1 ° 2 ' t h i* 2 ^- 2 ? + 1) 4 



^a 2 + a 2 )X l n- 2 ^ + 2) 



< 2E 



(27^) 



^-1 



(3 + 



^-1 



where we used the inequalities 



E 



2 A c x — 

_ ]c (e 2 \n~ 2 ? + l) 4 " ? 4 - 2 a ^ (o 2 \ in - 2 P + l) 4 o 2 ^ 



1 T /\ c 

< ^4 n - 



Therefore 



sup_E[(li 2 ")- 1 



a" 1 



sup / r[|cr 2 - a 2 \ > a 2 I~}t]dt 

a£\a_,a} 

< I sup F[\a 2 -a 2 \>5 n lf n t]dt. 



Application of Lemma [3] together with Jq 00 exp(— At)dt = A 1 yields 



sup E (L 

o-e [ct,ct] 



fr 1 



(7 



(T- 



< (^)- 1/2 (^)- 1 - 



Similar, 



sup_E[(/i 2 ")- 1 

o-G[ct,<t] 



sup_ / r[|cj 2 -a 2 \ > a 2 (I^)V 2 Vi\dt 
< f sup ¥[\a 2 -a 2 \>5 n {lf n ) 1/2 Vt]dt 



and because of / °° exp(— A\/i)dt = A 



sup E 



<re cr,<7 



(7 



<7^ 



By definition 5 2 I^ n —> oo. Since for sufficiently large n, by Lemma [H 



J™2 < max(l,a- 4 )/ 1 n < 2max(l,a- 4 )/f", 



(B.l) 
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it follows that 

|Var(^)- E (lff)-i| Al A 
sup — < h sup Var(o- ) - E(J~ 2 n ) = o(l). 

Proof of (II): From Taylor expansion, we find that for positive x, y, \x~ 2 — y~ 2 \ < 2(min(x, y))~ 3 | 
y\. Therefore, we can bound 



r 4 - _ iS = - V n~ 4l3 \ 2 ( 1 1 

- 2 2 4 V (a 2 n-^Xi + l) 2 (a 2 n-^Ai + 1) 



by 



i A c A c i x — ^ 

I 2 < V 



n ~ 6 ' A * U 2 * 2 I 



_ ' (min(er 2 , cr 2 )n 2 ^Aj + 1) ; 



o z' -2 2\r A n I 2 ~2i ^ / -6 i ,u-2r-*n 2 ~2| 

2max(CT ,a )^ 2 n A ~ 2 k -°" I < (a + 1 ) (5 n J ~ 2 F -ff |. 



Thus, 



sup_ I^IE(J^)- 1 - < 6~ 2 [ sup_ P(|cx 2 - a 2 \ > t)dt < {S^y 1 ' 2 -+ 0. 

Using (jB.ip . the convergence in {II) follows and this completes the proof. □ 



B.2 Proofs of Theorem El 

The proof of the main theorem builds in a very neat way upon an elementary analytical 
observation (cf. Lemma EJ) which leads in a second step to a trace approximation for positive 
semidefinite matrices (cf. Lemma EJ). This approximation result does not require any assump- 
tion on the behavior of the smallest or largest eigenvalue. Together with a rather standard 
but slightly technical Riemann approximation argument, we can then deduce a generalized 
version of Theorem [2J 

Lemma 4. Let (x n ) n , (y n )n, (Qn)n be positive sequences such that \x n — y n \ = 0(q n (y/x~^ + ^/y^,) 
Then, x n = y n (l + o(l)) + 0(ql). 

Proof. First, assume that q n = o(^/y^). The assumption \x n — y n \ = 0{q n (^/x^ + ^/y~^) ) is 
equivalent to \^fx^ - ^fy^\ = 0(q n ) and \{x n /y n ) l l 2 — 1| = 0(q n yn 1 ^ 2 ) = o(l). This implies 
(x n /y n ) 1/2 ~ 1 and thus x n /y n ~ 1, i.e. x n = y n (l + o(l)). Now, if q n = o(^/x^) the same 
argument shows that x n {\ + o{\)) = y n (but this is of course equivalent to x n = y„(l + o(l))). 
If neither q n = o{ y /y^) nor q n = o(y^) holds, then, q n > ^/x^+^/y^and therefore, \x n -y n \ = 
0(q 2 n )- □ 
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Let A be an n x n matrix. For convenience, we introduce the notation 



(A) :=tr(A 2 ). 

Lemma 5. Let A\, A 2 , B be (sequences of) positive semidefinite, nx n matrices and suppose 
that A\ is invertible. Then, for n — > oo, 

(A 1 B) = (A 2 B)(1 + o(l)) + 0{(B(A 1 - A 2 ))). 

Furthermore, if B < A^ 1 , then 

(A X B) = (A 2 B}(1 + o(l)) + 0((id n -A^A^A^ 2 )). 

Proof. By Cauchy-Schwarz, 

| (AiB) - {A 2 B) | = | tr(B(Ai - A 2 )BA 1 ) + tv(BA 2 B(A x - A 2 ))\ 

< \\B 1 I 2 {A 1 - A 2 )B l / 2 \\ 2 [tr^B) 2 ) 1 / 2 + tr({A 2 B) 2 ) 1 / 2 

For the last inequality we have rewritten ti{B{A\ — A 2 )BA\) and ti{BA 2 B(Ai — A 2 )) as 
tiiB 1 / 2 ^- A 2 )B 1 / 2 ■ B 1 l 2 A l B 1 / 2 ) and t^{B l / 2 {A l - A 2 )B l l 2 ■ B 1 I 2 A 2 B 1 I 2 ). Since (B(A 1 — 
A 2 )) = HB 1 / 2 ^! - A 2 )B l / 2 \\l, the result follows with Lemma H 

To prove the second claim, write 

S i/2 (Al _ A2)B H2 = B l / 2 A)! 2 (i& n -A- l/2 A 2 A- 1/2 )Ay 2 B l l 2 

1/2 1/2 

and note that due to A± BA^ < id^, 

(B(Ax - A 2 )) < (id n -A~ 1/2 A 2 A- 1/2 ) = (id n -A^A^A^ 2 ). 

□ 

In the case a > the multiplicative inverse of the spectral density h has a singularity at zero. 
In order to deal with this, we introduce the regularized spectral density h, which is defined 
as follows: Let (p n ) be a sequence of positive integers satisfying p n <C r n . Then, we define 



h(X) :-- 



h(\)Vh(u Pn ), \<u Pn 
h(X), else 



with u Pn as in (jA.ip . Replacing h by /, define in the same way /. We will prove a generalized 
version of Theorem [2] for a generic sequence (p n )n- In a second step the different versions of 
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the main theorem are deduced and p n will be chosen according to the specific setting. We 
may interpret this spectral regularization as adding an asymptotically non-informative (i.e. 
sufficiently small) WN process to our observation vector. This induces some stability, which 
becomes important in the bounds for the inverse covariance matrices. 

Theorem 4. Work under Assumption [7] and Assumption^ in model Suppose that 

a £ (-1/2, 1/2) andK - a > /3 V 1/4. If 

(i) (7fc)fc>o is in GM, / is bounded on any interval [5,ir] with 5 > and there exists a 
positive, quasi-monotone slowly varying function £, such that 

/(A) ~ 2sign(-a)r(-2a)cos(vra)A 2Q ^(l/A), 

(ii) n -4/3-4a-2+ 2e YZ=M,nHui,n))- 2 = o(r„), for some e > 0, 
(Hi) (D^(h)(D n (S n f) - T n (f))) + sup A6(0)7r] 7r 2 (A) = o(r n n^). 

Then, the asymptotic Fisher information of a 2 is 

„ 1-4/3 ^TT » 2 ,,. 

O = X fegy^d + o(l)) + o(r n ). (B.2) 

// the condition K — a > f3 V 1/4 is replaced by the weaker assumption K — a > f3, imposing 
additionally log(n)£ 2 (n) — > oo in the critical case K — a = 1/4, then (|B.2|) holds, provided 
there exists a constant Cj such that 

|/(A) - f{n)\ <C f n- 2 ? X 2a ~ 2 \X-n\, for all < X< h<tt. (B.3) 

Remark 1. Later on we will see that the different parts of Theorem^ follow from Theorem 
[7} If {X[)i has long-memory, condition {Hi) turns out to be quite difficult to verify. Although 
it would be easier (and more standard) to formulate the condition with respect to squared 
Frobenius norms, let us shortly explain, why the use of the (■) notation is essential. By 
definition, (A) = tr(A 2 ) for an nxn, square matrix A, which in turn can be upper bounded by 
the squared Frobenius norm of A (cf. Lemma\^ (i)). This is even an identity if A is symmetric 
but can be very rough in general. To see this consider (A)ij = 1/i. Then (A) = log 2 n but the 
squared Frobenius norm is of order n which is much worse. Since these phenomenons occur 
in some cases, condition (Hi) is stated for squared traces. 

Proof. Recall the explicit expression for the Fisher information in (|2.1|) . The proof is subdi- 
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vided into three steps, namely 



{D n (f)D-\h)) = I / g&d\{l + o(l)) + o(r n n^), 



vr Jq 

(T n (f)D~ l (h)) = (D n (])D-\h))(l + o(l))+o(r n n^), 
2I£ = (n-^T n (f) Cov(Z)- 1 ) = <n- 2 ^T n (/) J D- 1 (/ l ))(l + (l)) +o(r n ). 

which are denoted by (I), (II) and (III), respectively. 

(I): By the trivial bound £ f 2 (X)/h 2 (X)d\ = <3(n 4/3 ) = o(r n rt 4 ^), we can replace the normal- 
ization factor n/ir by (2n + l)/(2ir). Thus, using (|A.3|) . it is sufficient to show that 



E 

i=l 



/ 2 K) _ 2n+l / / 2 (A) 

Wrt 2n Jo TO aA 



o(r n n 4/3 ). 



(B.4) 



Now, let us treat the cases K — a > 1/4 V/3 and j3 < K — a < 1/4, separately. 

If K — a > 1/4V/3 holds: Using Assumption (i) and r n <C n, we can find integer sequences 
(r+) and (r~) such that p n <C r~ <C r n <C r+ <C n and (g n + (V~) _1 )r+ = o(r n ) with 



in ■ — Hny n . 



sup 

0<A<u + 
l</i<2 



CX la i(l/X) 



/(A) 



+ 



(B.5) 



Since a 2 n 2/3 f < h and a 2 n 2 @f < h, it follows that 



£ = o(r n n^) and n Q^dX = o(r n n^). 



and together with Proposition [3J we see that in (IB.4[) the sum over i = 1, . . . , r~ and i = 
r+, . . . , n as well as the integral over (0, u r -] U [u r + , 7r] are of order o(r n ) and thus negligible. 
Thus, we have proved (I), once we have verified that 



i=r~+l 



/ 2 K) 



2n+l 
2tt 



V / 2 (A) 

7?{A) dA 



o(r n n 



4/3 n 



(B.6) 



To see this, write 



E 

i=r n +1 



/ 2 K) _ 2n+l 
h 2 (ui) 2tt 



/ 2 (A) 
TO 



< E su p 



TO) 



< 



i=r n +1 

2n 4 ^ 



(7 



E su p 



n~ 2g /K) _ n- 2 ^/fe) 
h{Ui) h(£i) 



(B.7) 



i=r n +1 
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Fix i G {r~ + 1, ...,r+} and let ai = a 2 n" 2/3 /(«;), a 2 = <r 2 n- 2/ V(&) A = 4^r 2 sin 2i ^(u l /2), 
fr 2 = 4^r 2 sin 2K (^/2),ci = /i(uj),c 2 = Since < a\ < c\ = a\ + 61 and < a 2 < c 2 = 

a 2 + 6 2 , we find 



ai a- 2 
ci c 2 



< 



|ai - a 2 | + |&i 
ci V c 2 



(B.J 



Thus, for sufficiently large n, 



\ai - a 2 \ < (01 + a 2 )g n + a 2 C7n- 2/3 |u 2 ^(J-) - ^(^)| 

< (3 ai + a 2 )g„ + a 2 Cn- 2 ^(i)| U 2Q - g a \ 

< (3oi + a 2 )<?„ + ^^Cn-^^J-)^ - 1 < (3ai + a 2 )g„ + O^)- 1 ^. (B.9) 
On the other hand, we find |&i - 6 2 | < |^4 X r 2 sin 2 ^" 1 (f ) < 8K(r-)- 1 6i, and therefore 

<4 (? „ + (6 + 8K)(r-)- 1 . 



ai a 2 
ci c 2 



Due to (q n + (r n ) 1 )r+ = o(r n ), we see by ()B.7|) that (|B.6|) is bounded by o(r n n 4/3 ). This 
completes the proof for part (I) if K — a > 1/4 V /3. 

If f3 < K — a < 1/4 : The proof is very similar to the one for the first case. Note that the 
assumptions imply r n n 4/3 > n, K = 0, and a £ [—1/4, — Similar as above we see that it 
is sufficient to prove (|B.6P for r+ = n and any sequence r~ = o{r n ). We may assume that 
r~ — > 00. Since by (IB.3jl . / and h are continuous, we can apply the mean value theorem, i.e. 
for any i there is a £j £ 1, Ui] with 



fJM _ 2n±l 



1 / 2 (A) 



TO 



(B.10) 



and 



n 

E 7?K) 



f 2 (ui) _ 2n+l 
2vr 



< E 

i=r„ +1 



h(ui)Vh(ti) 



Pick an integer sequence w n such that w n = o(n) and for some e > 0, n~ Aa+e w! l 
Let q n {wn) be as in (|B.5j) with r+ replaced by u> n . Note that since 
{(ln{w n ))n tends to zero. As in (|B.9p . we find for sufficiently large n and for all 



4a+e„,,4a-e-l 



(B.ll) 



oil). 



0, the sequence 



1 = 7V 



\f(ui) - /(&)| < (3/(«i) + /(&))<Z„K) + 2a 2 vrCr 1 - i e 



-l t 2a-l-e 



(B.12) 



For large indices, we use the estimate (O, i.e. |/(«;) - /(&)| < C/^n- 1 ^ 20 " 2 for i = 
w n + 1, . . . , n. Split the sum in (|B.lip into X)^* _ + 2~Zr=«>„+i • It is eas Y to bound the 
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second sum, using the definition of (w n ) n , which in turn implies that n 1 Ya=w +i e ^ 

n i-4a+e u; 4«-i-e _ G ( n )_ Similar, computing the sum J^ W " - > over the second term in ()B,12j) 
yields for small e, n" 1 - ^ a ~ 1 - 2e < n - 4a+2e w^ a ~ 2e = o(n). For the first term in flBUD 
we see that (|B.11|) can be further bounded by a multiple of q n (wn)(YH=i f 2 (ui)/h 2 (ui) + (2n + 
l)(2vr)- 1 Jq f 2 (\)/h 2 (\)d\). Putting all estimates together, we have derived 

i m-^c^^^m^ +<<»>■ 

i=r n +1 r n 

This finishes the proof of part (/). 

To prove (II) and (III) it will not be necessary to distinguish whether K — a > 1/4 or 
K-a< 1/4. 

(II) : In Lemma El set A x = T n (f), A 2 = D n (f) and B = D~ l (h). We have to show that 

(B(A 1 - A 2 )) = (D-\h)(T n (f) - D n (J))) = o(r n n 4 P). (B.13) 
First, note that due to (|A.3|) . a 2 n~ 2l3 f < a 2 n~ 2l3 f< h, and p n < r n , 

(D- i (h)(D n ( f ) - dm))) = t (/(M i~y ^ 2 ^ = 

pi (h( Uj )) 2 ^ 

By Assumption ('fk)k G GM and a G (—1/2,1/2). Therefore, we can use the estimate from 
Lemma E] (i) together with (|A.4p . i.e. there exists a constant C±, such that 

\f( x )-S n f(x)\<C 1 -(\ ln \ + £ ¥)^-^ 2Q_1+2e ' for all [± tt), 

fc=n+l 

where the second inequality holds for sufficiently large n and e small. With (|A.3P and As- 
sumption (ii) this yields 

{D- l (h)(D n (S n f) - D n (f))) = o(r n n^). (B.14) 

Decompose T n (/) - D n (f) = (T n (f) ~ D n (S n f)) + (D n (S n f) - D n (f)) + (D n (f) ~ D n (f)). 
Now by Lemma El (iv) and Assumption (iii), (|B.13P follows. 

(III) : Set Ay = Cov(Z)" 1 , A 2 = D^Qi) and B = a 2 n~ 2fi Cov(X) and apply Lemma U Since 
B < Cov(Z) = A J" 1 it is sufficient to show 

( id„ -Al /2 A^Al /2 ) = ( id n -D- l l 2 (h) Cov(Z)D- 1 / 2 (h)) = o(r n ). (B.15) 

By the perfect diagonalization property of Cov(Y) (cf. the remarks in Section [A}, we have 
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Cov(Y) = D n (h - a 2 n- 2 Pf) and 



Cov(Z) = <r 2 n~ 2/3 [Cov(R) + Cov(X',R) + Cov(R,X')] + a 2 n- 2 ^(T n {f) - D n (f)) + D n (h). 
Together with Lemma [6] (iv) , 

( id n -D- l ' 2 {h) Cov{Z)D- l ' 2 {h)) <{ id n -D n {h~ l h)) 

+ n" 4/3 sup /i" 2 (A)||Cov(R) + Cov(X , ,R) + Cov(R,X , )||2 

Ae(0,7r] 

+ n-^(D- 1 (h)(D n (f) -D n (S n f))) 

+ n-^(D-\h)(D n {S n f) - T n (/))). (B.16) 

For the first term note that because of ()A.3[) and < 1 — h/h < 1, 



pn t / 

(id n -D n {l~ 1 h)) = Y J {i-rr 



i=l 



2 



o{r n ). 



The other three terms on the r.h.s. of (|B.16|) can be seen to be of order o(r n ) as well, by 
Assumption [fl Assumption (ii), ()B.14|) and Assumption (iii). Therefore, (|B.15p holds and the 
proof is completed. □ 

Proof of Theorem QJ Part d We check the conditions of Theorem HI Note that the special 
case, i.e. K — a < 1/4 implies together with K— a > f3 that K = and a £ [—1/4, — /?). Hence 
this case only plays a role in Part [2j and Part [3] (in the latter only if K = and a = — 1 /4) . 
All the derived estimates will work for both situations and thus, in the following, we do not 
distinguish between these two cases explicitly. 

Let p n = n 1 ^ 4q ^ ^ K~a^ +S <g r-n f or some 5 > 0. Such a 5 always exists thanks to the 
assumption K — a > (4a + l)/3. This assures that 

(Pn/n)~ 4a ~ 2e = o(r n ) for e small enough. (B.17) 

(i) : By [27], {^k)k £ GM . The second part follows from Lemma [TO 

(ii) : Making use of inequalities (|A.5l) and (IA.6j) . 

n 

-4/3-4a-2+2,V-/ ,. T, xs-2 



n ^ .„ " ' "^(tij,„/l(tij,n)) 
i=l 

Pn r n n 

<n" 4/3 ~ 4a ~ 2+2e [^(^( JL ) 2a+£ n 2 ' 9 ) 2 + ^ (n)4a+2e+2^ 4/ 3 + ^ 

j=l j=p n +l 
<n 4£ (p„)~ 4a - 2e + n 4(^-a-/3)+2 er -4X-l = ^ 
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if e is chosen small enough. 

(in): By Lemma (hi) and (i), LemmaEH and T n (f) = T n (S n f), 

(D- l {h)(D n {S n f)-T n (f))) < WD-^WKD^S^-TM) 
<\\DnHh)\\lo\\D n (Snf) ~ T n (f)\\i $ (v)' 4 "" 2 ^ = °^ n ^ 



if 5 and e are chosen appropriately. The bound for sup Ag ( ,7r] h 2 (A) is the same and this 
completes the proof of the claim. □ 

Proof of Theorem^ Part\^ Let p n = 1, i.e. h(v,i) = h(ui). The proof of this part is similar 
to the one for (i). Again, we check the conditions of Theorem 0J 

(i) : This follows from Lemma [TTJ 

(ii) : Splitting the sum Ya=i = J2l=i + Ya= t +i> we ^ n< ^ f° r small e, by inequalities (|A.5[) and 

dAH), 

n 

n _ 4/ 3-4 Q -2+2e ^2(u hn h( Ul , n ))- 2 < + U^ K ~ «~ ^ T~ 4K ^ = o{r n ). 

i=l 

_ 2K/3 

(Hi): Observe that by (|A.5[) and (|A.6|) . h(X) > n n-a-e/a , Together with Lemma[9l 
( J D- 1 (/ l )( J D n (5 n /)-T n (/)))+ sup /T 2 (A)< sup /i- 2 (A)(|| J D n (5 n /)-r n (/)||^ + l) 

Ae(0,7r] Ae(0,7r] 

□ 

Proof of Theorem^ Part\^ We apply Theorem EJ with f(u%) = f(ui) and h(v,i) = h(v,i), i.e. 
Pn = 1. 

(j^: This follows from Lemma [TU] and Lemma [TTJ 

For the following parts we make frequently use of the inequalities (|A.4jl - (|A.6j) and the subse- 
quent comments. 

( ii): Since u Tn — > 0, we find, if n is sufficiently large, 

n i Tn 4/8 " 1 

V 1 < V n i V 1 < „2+4/3+4a+2e -l-4a , 2+4K -1-4RT 

^ u1h{uiY ~ ^ m 2 /«) 2 ^ - 2 + 4 *~ 



i=r„+l n i 
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Therefore, 



t- 



for e sufficiently small. 

It is straightforward to bound sup Ag ( 0j7r ] h~ 2 (\) by a multiple of n K -° * f -l 2 = o(r n n 4 "), 
which immediately implies that the second term has the right order. However, to show the 
same rate for the first term turns out to be the most difficult part of the proof. Let us shortly 
remark on that. The crucial point is that although we have good control on the spectral 
density h, this gives no direct link to entries of the inverse of D n (h). In contrast to the 
proofs above, estimating (D- x {h){D n {S n f) - T n (f))) by (sup A l/h(X)f\\D n (S n f) - 
is too rough (cf. also Remark [T]). Therefore, we look for a new function, say g, with the 
properties that l/(gh) behaves like a constant for small A and D n (S n g) is explicitly known. 
It turns out that g = fi/ 2 + a is a good choice, where fi/ 2+a denotes the spectral density 
of a fractional Gaussian noise process with Hurst index 1/2 + a < 1/4, cf. Lemma [T2l for 
details. Furthermore, r 1 / 2+a denotes the corresponding autocovariance function. From (|A.5|) 
and (IX6l) . we obtain 

A£ [I )7r ) /l/2+a(A)MA) 

Define 



/:= {D-\hh /2+a )(D n {f 1/2+a ) -D n (S n f 1/2+a ))(D n (S n f) -T n (/))>, 



II 
III 



((D n (S n f 1/2+a ) -T n (f 1/2+a ))(D n (S n f) -T n (/))>, 

<T n (/i /2+a )(£' ri (s'n/)-r n (/))) 



and note that by Lemma (iii) , 

- A (D-\h)(D n {S n f) - T n {f))) < I +\\D- 1 (hf 1/2+a )\\ 2 JII + ///) 

</ + n 4/3+2e (/I + ///). (B.19) 

In the following we will bound the terms /, II and III, separately. 

(I): By (ii) and (it>) of Lemma 1121 we know (ri/ 2+a (k))k £ GM and that fi/ 2 + a behaves like 
a multiple of X~ 2a for A | 0. Using Lemma M (i) and (1B.18|) . 

SUP 1/1/2+0 W - gn/l/2 +a (A)| n2 , +1+6 (, (n)| + g 2 , +2Q+e _ 
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Thus, with (|A.3|) and Lemma [6] (iii) , 

I < \\D- l (hf 1/2+a )(D n (f 1/2+a ) - D n (S n f 1/2+a ))\\l(D n (S n f) - T n (f)) < n 4 ^. 

(II): By Lemma [T2l (i), and the boundedness of the sequence (ri/2+ a (fc))fc, we see that there 
exists a constant C a such that for all k G N, |ri/ 2 + a (fc)| < C a k 2a ~ l . Using Lemma [7] and 

dA3D, 



[(AA/i/a+a) -T n (S n f 1/2+a ))(D n (S n f) -T n (f)) 



h3 



n 



<C a C r (I* + fc| 2a_1 + 1 2n + 2 — A; — i] 2 ^ 1 ) (|j + fc|- 2a - 1+<5 + |2n + 2 - j - k\- 2a - 1+5 ). 
fc=l 

Define := Y2=i I* + ^| 2a_1 |i + A 4-2 "" 1 - It is well-known that if (a fc ) fc and (6 fe ) fc are 

non-negative sequences which are monotone increasing and decreasing, respectively, then 
Yl=i a kbk < J2k=i a kb n +i~k- Thus, 

eij < C a C r (2nf (F(i,j) +F(i,n + 1 - j) + F(n + 1 -i,j) + F(n + 1 -i,n + 1 - j)). 

From the monotonicity of a; i— > x~ 2a_1 and x i— > x 2a ~ l for x > 0, 

n roo 1 

^(i, j) < r 2a ~ x Y\i + k] 2 "' 1 < r 2a ~ x \ x 2a ~ x dx = —r 2a ~ l i 2a . 

fe =i Ji 2 M 

This allows to bound &ijej ; i by a multiple of n 2<E (min(i, n + l — i) min(j, n + 1 — j)) . Hence, 
II<Ht J =ie l , ] e hi <n 2 nog 2 n. 

(Ill): First let us introduce the projection II = II n defined for an n x n matrix A = 
(a>i,j)i,j=i,...,n by (TlA)ij := a^j if i + j < n + 1 and zero otherwise. Further let E de- 

: ;3 

Note that by Lemma (iv 



note the n x n matrix (E)ij := 1 if z + j = ra + 1 and zero otherwise. In particular E? 2 = id n . 



(T n (h/2+a)(D n (S n f)-T n (f))) 

< 2(T n (f 1/2+a )U(D n (S n f) -T n (f))) + 2(T n (f 1/2+a )(id n -n)(D n (S n f) - T n (/))), 
where by a slight abuse of language id n denotes here the identity operator on the space of 
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n x n matrices. To bound the first term, we decompose 

2i-l 



(r„(/i/2+a)n(D n (5„/) - T n (/))). = E ri/2+«(i - A0[7fc+j-i - 7i+j-i] 



'-J 

fe=i 



2i-l 

+ 7i+J-l E r l/2+c*(« - k) 
k=l 

n+l-j 

+ E r l/2+a(i ~ khk+j-l 
k=2i 

=: A 1 (i,j)+A 2 (i,j) + A 3 (i,j), 

with the convention J^kt^^ = ~ 2~ZfcL n +i-j if 2i > n + 1 — j. By assumption |7 9 + p — J p \ < 
l7u+l - 7«l ~ rfqp~ 2a '~ 1 . Hence, uniformly in i, j, 

2i-l 

\Mh j)\ <n e Y, min (N " fc | 2 °. !)i" 2a " 2 £ n e z 2a+1 j- 2a - 2 . (B.20) 
fc=l 

If 2j < i, we can split the sum Y^Z\ = Sfc=i"' ^fc^^J+i ' Then, the first part of j)\ 
can be bounded also by 



L«/2J U/2J 

I E ^1/2+^ - fc) [7fc+i-i - 7i+i-i] I < "^ 2 °- 1 E ( fc + J " 1 )~ 2a ~ 1 % nH ~ l 

k=l k=l 

and the second part is by the same arguments as in (1B.20|1 (j can now be replaced by i + j) 
of the order n £ i 2a+1 (i + j — i) _2a ~ 2 < n e i . Together, this shows that 



\Mhi)\ 



< 



n e i~ l if 2j < i. 



With Lemma [T21 (i) and telescoping, Y^k=i r i/2+a(i — k) = i 2a+1 — (i — \) 2a+l and therefore, 

\A 2 {i,j)\<rf{i + j - I)- 2 "- 1 ?". 
The last term of the expansion can be simply bounded by 



\Mh j)\ <™ e (i+j-ir 2a - 1 E K-^-^n^+i-i)- 2 -- 1 



-2q-1^2o 



k=2i 



In particular, the bounds for A2(i,j) and A3(i,j) are uniformly in i,j as well. Hence by 
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elementary computations 



\(T n (fi/2 + a)(D n (S n f)-T n (f)))\ 

<Y,\Mhj)+Mhj)+Mhj)\\ A ^ U, + ^2 (j, i) + A z (j, i) | < n 2e log 2 



n. 



h3 



Finally, note that ET n (f 1/2+a )E = T n (f l/2+a ) and E ((id n -U)[D n (S n f) -T n (f)])E is a 
matrix with entries — 7i+i for j + j < n and zero otherwise. Therefore, we have by rewriting 

(T n (/i/ 2+Q ) ( idn -n) (D„ (S n f) - T n {f) ) ) 
T n (h/2 +a )EUid n -U)(D n (S n f) - T n (f)) 



E 

the same structure as above (up to an index shift by one) and all arguments apply. This 
shows that 77/ < n 2e log 2 n. 

The estimates in (/), (II), and [III) show that the r.h.s. of (IB.19|) can be upper bounded 
by n 4e+4 ^ log 2 n, and hence Assumption (iii) of Theorem 0] follows by choosing e sufficiently 
small. 

Since we have verified (i),(ii), and (iii) this allows to apply Theorem H] and therefore the 
conclusion of the theorem follows. □ 

B.3 Proof of Theorem [3] 

We use the superscript J, to indicate the approximation of a function at zero, i.e. define for 
AG (0,vr], 

f±°(\):=C a \ 2a e(±), and f°(X) := C a \ 2a t(u^), (B.21) 

with C a = 2 sign(— a)T(— 2a) cos(-7ro:). Similarly 

^O(A) := a 2 f±°(\) + r 2 \ 2K and h X \\) := a 2 /+°(A) + r 2 \ 2K . (B.22) 

In order to prove Theorem O we will show in the following propositions that up to negligible 
terms 







where (r+) n is a generic integer sequence satisfying r n <C r+ -C n. 

Proposition 1. Work under the assumptions of Theorem^ For any non-negative sequence 
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{y n ) n , tending to zero, 



;££$) 2 dA(i+ (i)). 



Proof. By Lemma El /(A) ~ f l °{\) for A 1 0. Since i/ n -»■ 0, it follows from ([Oil . CO) , 
and Taylor expansion of sin 2i ^(-) that 



<? n := sup 



I /(A) 
I /40(A) 



1 + 



MA) 
^«(A) 



Ae(o,i/ n ] 

Therefore, we can estimate 

Jo U W j Jo [ fc W ' 
and the proof is complete, due to 



1| < sup 2\ 

Ae((V„] 



/(A) 



, I , |4 K sin 2if (A/2)-A 2K 
i| + | x?K 



0(1). 



< 3 



/ / i0 (A) ^2 I- ( f^W \ 



/^°(A) s2 
„ ^°(A)' A 



< 3 



(MA)) 2 



(/^°(A)) 2 (M"(A)-/i(A)) M°(A) 



(/ l (A)) 2 (/ i W (A)) 2 
, 2 | MA) 

i j MAH IPW 



f/A 



<12/ (|<iM 



< 12ft, / (|$) 2 <iA. 

i/ 



□ 



Proposition 2. Let (V+) n &e as above. Then 



•"(fi^) 2 « = //M^A + »K,,n«). 



Proof. We define L(-) = (£(-)) 2 , which is again slowly varying. The lemma is proved, once we 
have established that \I + II + iXT + IV \ = o(u rn ), with 



I = 


r u r„ 
Jo 




fUr n 

Jo 




II = 


r u r n 
Jo 




r u r n 
Jo 


h (A) 


III = 


r u t n 

Ju r „ 




J 




IV = 


f U r + 
j 1 n 

Ju rn 




f U r + 
j ' n 

Ju rn 


h (A) 



(I): Note that by substituting s = u Tn \ 1 , 
HI < a 



L(l/A) 
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Since —> oo, we find by Potter's bound that \L{u~^) / L(su~^)\ < 2s 1//2 , for sufficiently 
large n and all s G [l,oo). Therefore, by dominated convergence and the definition of slowly 
varying functions, 

Hm f°° \L(su^)-L(u-^\ s -2 ds = f°° Um \L(su^)-L^)\ s -2 ds = q 
n— >oa L(su rn ) J* n— >oo L(su rn ) 

and therefore I = o{u Tn ). 
(II): Let < ai,a,2,b. Then, 

I 1 1 I _ / 1 , 1 \ [oi— ag] <• tl_ 1 1 \ [ai— 02J _ I 1 _ 1 I 

I (ai+fe) 2 (aJ+Fpl lai+t + a 2 +6/ (a!+6)(a 2 +()) - Ui + a 2 / aia 2 I ^ 

With 01 = a 2 n- 2 ^7 4 °(A), a 2 = a 2 n~ 2 ^f^{\) and 6 = /i+°(A) - a 2 , 
I //I < a" 4 



I tFw - 1 I^ A = ^ y o l(i/a) " ; ' dA = (u r j, 



where the last step follows from (J). 
(Ill): By estimating /i+°(A) > t 2 X 2K , 



\III\ < / r " (CT- 2 n- 2 P\ 2a ~ 2K ) 2 \L(l/\) - L(u-l)\d\ 



C 2 T~ 4 n-^U l r +ia - 4K L( U - 1 ) [ Urt ' Urn X 4*-4K\ L^/X)-L{^) , dx 



< 



1 ^V"r„ 



By arguing similar as in (I) and (iT) , and since 4a — 4iT < — 1 , we see that the integral tends 
to zero and therefore, using Lemma [21 \III\ = o(u Tn ) follows. 

(IV): Using (^°(A))- 1 + (^°(A))- 1 < (j- 2 n 2 P(f X °{\))- 1 +(j- 2 n 2 P{f^{X))- 1 and the elemen- 
tary inequality (ab)~ l < a~ 2 + b~ 2 for a, b > 0, we obtain 

\w\<a- 2 n 2 n {r , {\)Y[^— + ^^ \ iai </a 



/ +U (A) / +U W/ / l +u (A)h4-0(A) 

* ^ v/? P ^C£ l^°( A )) 2 - (7"°( A )) 2 l((/ 40 ( A )r 2 + (7"°(A))- 2 )dA. 

By expanding the last sum and treating each of the two terms separately, we can argue along 
the lines of (III). □ 

Proof of Theorem^ We consider two cases. First assume that K — a > 1/4. Let (r+) n be 
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an integer sequence such that r+ <C r n . Define 

C n ■= n- 2/3 £(u- 1 )2r(-2a)cos(vra) and C' n := {C- l a~ 2 r 2 ) l ^ 2K - 2a \ 
Application of Propositions El [H and [2] and substitution show that 

jT ff^dX = a^n^ £ r " (1 + C~ l a~ 2 T 2 \ 2K ~ 2a Y 2 d\(l + o(l)) + o(r n n^) 

= {C' n )- l a- A n^ [ Ur " /C '\l + X 2K - 2a )- 2 d\{l + o(l)) + o(r n n 4fS ). (B.23) 



Observe that by LemmaEl u r +/C' n -> oo. Since K-a > 1/4 the integral / °° (1 + \ 2K - 2a )- 2 d\ 
is finite and we can approximate the r.h.s. of (jB.23|) further by J Q °°(l+X 2K ~ 2a )~ 2 dX(l+o(l)). 
By using formula (3.251.6) in [TS] 



1 ^ j~r xir ~.:,ix ^- 2) 



+ A 2^- 2Q y K-a Jo (1 + A 2 ) 2 4^ - a) sin^^ - 2)/2) 

and for K — a = 1/2 the r.h.s. is defined by 1. Combining the last identities (recall that for 
the Fisher information, the integrals are scaled by n 1-4 ^/(2-7r)) and application of Lemma [2] 
completes the proof for the first case. 

Now, let us prove the second statement of Theorem [HJ i.e. the case K — a < 1 /4, i.e. K = 
and a £ (—1/4, — f3). With the upper bound in (1A.5|) (extended for all A G (0,ir]), we have 
/(A) < A 2a ~ e and jf /a / 2 (A)(/i~ 2 (A) + l)dA = o(l). Since, l//i - r" 2 = o 2 T- 2 rT 2fi f jh we 
obtain also 

g& - r" 4 / 2 (A) dA < 2a 2 r-"n- 2 ^ (\ f(X)d\ < n 2 ? \\ X^dX = o(l) 

J net Jnoi 

for small e. Combining the last approximations yields 

= ^3-/ f 2 (X)dX + o(n^) = ^- r - £ 7fc 2 + o(n^) 

fc=— 00 

since / = J2kL-oo cos(k-)jk- This completes the proof of (|2.14|) , 

Finally, we evaluate the integral in the critical case, i.e. we prove (|2.15|) . Because of 
log(x)^ 2 (x) — ^ 00 for x — > 00, one can find a non-negative sequence (v n )n, tending to zero, 
such that 

f (§y) 2dA £ r A_1 " 2e(iA £ "»* = o(log(n)t 2 (n^)), (B.24) 

Ji/ n Ju„ 

where we used the global upper bound for /, cf. (IA.5j) . in the first inequality. Since v n tends 
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to zero, Proposition Q] applies. With f i0 and h i0 as defined in (fR2~Tj) and (|R22|) . 

2tt in {W ^' 



ft = ^ T (£M) 2 d\(l + o(l)) + o(n^ log(n) l\n^)) 



(ffi )^A(1 + o(l)) + o^ 1 - 4 ^ log(n)^(n 4 ^)). 



Note that if — a = 1/4 implies a = —1/4. Observe also that by Potter's bound, 
(/+ (A)) 2 |(M°(A)r 2 - r- 4 \d\ < f " (/^°(A)) 3 dA 

For a = -1/4, /+°(A) = ^\- l / 2 l{l/ A). This shows 

7" =n 1 - 4 ^r- 4 /' 1 ^(i)^(l+ (l)) +0 (n 1 - 4 ' 3 log(nK 2 (n 4 / 3 )), 

where the integral f v is estimated as in (IB, 24ft . □ 
B.4 Proof of Corollary [1] 

Proof. For H = 1/2 the result follows from [16] . If ii ^ 1/2, we check the conditions of 
Theorem (2) Note that the three versions correspond to < H < 1/2, 1/2 < H < 3/4, and 
3/4 < if < 1. By using the properties of fGN, listed in Lemma [T21 (for instance in order to 
show J2T=-oo 7fc = for < ii < 1/2, use Lemma [T21 (i)), Versions Q] and [2] are obviously 
fulfilled, but some work has to be done for the third part. 

To bound the increments of the autocovariance function, i.e. |7 p+ i— 7 p | = |r#(p+l)— rn{p)\ < 
r H{p)/p = 7p/p ; define on M + , x h4 g(x) = \x\ 2H . Then there exists £i £ [x,x + 1] and 
£ 2 e [x-1, ac] such that by Taylor expansion g(x + l) = ELo( fc! )~ + 24~V 4) (£i) and 

9i? " 1) = ELo(-l) fc (*0 _1 » (fc) (^ + 24- 1 5 ( 4 )(6). Thus, 

^)=/(g) + ^(s ( %i)W 4) fe)). 

By another Taylor expansion and p > 1, |r_ff(p + 1) — th(p)\ ^ p 2H ~ 3 . □ 

Appendix C Further technicalities 

First, let us summarize some facts about Frobenius norms. 



32 



Lemma 6. Given n x n matrices A — ifli,j)i,j=x,...,ni 

B,C. Let (A) = tr(A 2 ) and denote by 
ll^-lh = j a l j) 1 ^ 2 the Frobenius (or Hilbert- Schmidt) norm. Then, 

(i) {A) < ||-A|L, and equality holds if A is symmetric. 

(ii) A and B are circular commuting, i.e. (AB) = (BA) 
(Hi) If A > and B Hermitian, then, (AB) < ^{"^(B). 
(iv) If C > as well as A and B Hermitian, then 

(C(A + B)) = \\C X ' 2 {A + B)C^ 2 \\ 2 2 < 2(CA) + 2{CB). 

Proof, (i): Due to 2uv < u 2 + v 2 , ti(A 2 ) = (Ey aija jti ) 2 < EijOy = \\A\\l. (Hi): Write 
A = D t AD with A diagonal. □ 

The next result is well-known, cf. Sanchez et al. [25], p. 2635. For sake of completeness we 
give a short proof. 

Lemma 7. Assume that g = J2m=-n 9m cos(m-) with g m = g- m and let D n (g) be defined as 
in (|A.3p , Then, we have for the (i,j)-th entry (D n (g))ij = gu-j\ + 9i+j-i ifi+j<n + l 
and (D n (g))ij = - g 2n +2-i-j if i + j > n + 1. 

Proof. Recall (|A.1[) and t|A.3|) . Expand 

n n 

(- D n(5')) ii = gm^2[cos((i+j-l+m)u k ) + cos((i+j-l-m)u k ) 

m=—n k=l 

+ cos ((i — j + m) u k ) + cos ((i - j — m) u k ) ] 

using g{uk) = J2m=-n 9m cos ( mu k) and 2 cos(x) cos(y) = cos(x+y)+cos(x— y). With the well- 
known summation formula for Dirichlet kernels, we find that for an integer £, J2k=i cos(^Ufc) 
equals l)^/( 2n+1 ), if £ is divisible by 2n+l, and i(— otherwise. If we rewrite in the first 
case n (_i)*/(2«+l) as ( n+ l)(_i^/(2n+l) + i ( _ 1) m j we find that {D n {g)\ 5 = g^ +g i+j -i 
if i + j < n + 1 and (D n (g))ij = g^^ - g 2n +2-i-j i£i + j > n+l. □ 

In the next lemma, we collect a number of properties of general monotone sequences. 
Lemma 8. Assume that (ak)k £ GM and define f = ao + 2^fcLi Ofc cos(Zc-). 

(i) IfJ2k%i k~ 1 \a k \ < co, then there exists a constant C such that 

1 00 I I 

\f( x )-S n f(x)\<C-(\a n \+ ]T i^i), i/x€[i,7r]. 
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(ii) If additionally a k ~ k 2a t(k) holds for a > — 1/2 and a slowly varying function £, 
then {a,k)k is of bounded variation, i.e. J2T=i \ a k+i — «fc| < °o- 

(Hi) If ak ~ k~ 2a ~ 1 £(k) holds for a G ( — 1/2,0) and a quasi-monotone slowly varying func- 
tion £, then 

/(A) ~ 2A 2Q ^(i)r(-2a)cos(TO). 

Proof, (i) and {Hi) can be found in the recent paper by Tikhonov [27J. To verify (ii) note 
that for sufficiently large K, using the definition of GM and (|A.4|h Y1T=K \ a k+i ~ a k\ < 
E^Lo |o2*jcI < 00 • D 

The following lemma gives a very useful bound on the Frobenius norm between the matrices 
T n (f) and D n (f) defined in CO|) and (L\~3|) . 

Lemma 9. Lei / = X)fe^=o a fc cos(fc-) anc? assume that (a k ) G GM . 1/ Efc>i & | a fc| < °°! 
£/ien, i/iere exists a constant C, such that 



OO I I u 

\D n (f)-T n (f)\\ 2 <Cn(\a n \+ £ IM) + ( 2 £ fcajf , 



fc=n+l fe=l 



Proo/. Note that ||Z> n (/) - T n (f) || 2 < ||L> n (/) - D n (S n f) || 2 + ||£> n (S n /) - T n (f) \\ r In order 
to bound the first part, we use the estimate from Lemma [8] (i) together with (|A.3|) and hence 
\\ D n{f) ~ D n{ s nf)\\l = Y,k=i(f( u k) ~ S n f(u k )) 2 for u k as in (|A.1[). Since n fc > 1/n, this 
yields 



|l? n (/)-^(5 n /)|| 2 <C7 1 n(|a n |+ £ 



00 1 1 



k=n+l ft 

for another constant Ci. Using Lemma [7] and (T n (f))ij = fu-j\, the second term can be 
bounded easily by (2 ££=1 kf 2 ) 1 / 2 . □ 

Lemma 10. Assume that (jk)k satisfies the assumptions of Theorem^ VersionlM Then it 
is also in GM . 



Proof. The following arguments are for J large enough. Observe that by Potter's bound, the 
sequence (bj)j, 

-1 2J-1 
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is uniformly bounded. Using also (|A.4|) 



E l7i+i "7il<Ci E - ^ Cibj£(J)J' 2a ~ l < 7J . 



j=j j=j J 



□ 



Lemma 11. Under the assumptions of Theorem^ we have that f is bounded on any interval 
[S, 7r] with S > and 

/(A) ~ 2sign(-a)r(-2a)cos(7ra)A 2a £(l/A). 

Proof. For a € (0, 1/2), the sequence (jk)k is absolutely summable, and hence / is bounded on 
[0, 7r]. The behavior at zero, i.e. /(A) ~ — A 2a ^(l/A)2r(— 2a) cos(7ra), follows from Theorem 
1.2 in [8] and the assumption Efcl-oo 7fc = 0. For a G (-1/2,0), we have (7*.)* G GM 
(cf. Lemma [TO]) . Hence, by Lemma [S] (ii), (jk)k is of bounded variation and therefore (cf. 
Zygmund [25], Theorem 2.6) / is bounded on [5,ir] for every 5 > 0. The second property is a 
consequence of Lemma [5] (hi) and Assumption [TJ □ 

Proposit ion 3. Work under the assumptions of Theorem [^| with K — a > 1/4 crod /ei (?"^)n 
be an integer sequence satisfying r n <C <C n. Then, 



n 



£ (^) 2 d\ = o(r n n^) and ± (gg) 2 = o(r^). 



Proof. Because of IT — a > 1/4, there exist <5' > and an integer no such that by Potter's 
bound we find for all n larger than no, 

) 



< 2 / u^- Q - 1 / 4 , for all /i G (0, S/u r+ ]. 



t(— ) 

Let (5 be as in ([AT5l) . Together with (jO]l . we obtain 



< 2-4Tt~*u 

< 4^ +1 r- 2 n 2 r 2i V-*-V4g(_L_) ; for aU ^ G ( , (J A <5')/ 



1/ . 



Since / is bounded on [5 AS', tt] under the assumptions of Theorem [5] (cf. Lemma [TT|). we also 
have /(A)//i(A) < \~ 2K , for all A G (5 A <5', 7r]. Now, we are ready to prove the two statements 
(denoted by (I) and (II)) of the proposition. 
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(I) : Decomposing = J^ AS + Jg AS , and substituting /i = A/u r + in the first integral yields 

r 71 1 r(8A8')/u + / /(u + u) N 2 t"K 

/. + = ^(^) * + 

< 4 2A' + 2 r -4 n 4 r 4^ + l £ 2 ( ^ ) ^ ^-2^-1/2^ + o^^). 

Note that for two positive sequences (a n ) n and (6 n ) n with a n <C b n , and any k > 0, we have 
for sufficiently large n, 

Apply this for a n = l/u r + and b n = l/u Tn together with Lemma [2J Then, 

\t K)+Y ^ «ut- K)+i n^) = o(u r y?) 

and this completes the proof of (J). 

(II) : Let K n be the largest integer, s.t. u Kn < 5 AS'. Similarly as for (/), we rewrite 
Note that 

^ J2 (^-) 2a - 2K - 1/2 < J2 ffl 2a ~ 2K ~ 1/2 < / (2A) 2 - 2 ^ 1 / 2 ( iA < oo. 



n. 



Arguing as above yields the required result. □ 
Finally, we summarizes a number of facts on fractional Gaussian noise. 

Lemma 12. Let (iV^r )fceN denote a fractional Gaussian noise process, i.e. a centered Gaus- 
sian process with covariance 



Cov(iVf ,iVf ) = U\k - j + 1| 2 " - 2|* - J\ 2H + \k - j - II 2 "). 



Denote by fu and rn the spectral density and the autocovariance function, respectively. Then, 

(i) r H {k) = \{\k + l\ 2H - 2\k\ 2H + |ife - 1| 2H ) ~ 2H(2H - l)k 2H ~ 2 . 
(H) (rff(^))fcez is O-regularly varying quasi-monotone and (therefore also) in GM 
(Hi) /h(A) = 2sin(7r#)r(2im)(l-cosA)£!*L t30 |A+27rj|- 2H - 1 ~ sm(TrH)T(2H+l)\ 1 - 2H 
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(iv) There exist positive constants Cu and C' H , such that Cj^A 1 2H < ///(A) < C'jjX 1 2H 
for all A G (0,ir}. 

Proof, (i), (iii) and (iv) are well-known (cf. [3]). (ii) follows from the monotone decrease of 
(r H (k)) k . □ 
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